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(57)Abstract: 

PROBLEM TO BE SOLVED: To provide the face image registering device 
which can exclude the registration of an unnecessary face image without 
placing any burden on a user and also increase dictionary precision. 
SOLUTION: An information terminal device 10 with security consists of an 
image input part 1 1, a face area extraction part 12, a state recognition part 
13, a recognition part 14, a dictionary generation part 15, an access control 
part 16, dictionary images 17, and a specific operation detection part 18. 
After the specific operation detection part 18 detects specific operation of 
the user, the face area extraction part 12 extracts an image of a face area 
from an image inputted from the image input part 11 and a dictionary 
generation part 15 judges whether or not the image is valuable enough to 
register as a dictionary image 1 7. 
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* NOTICES * 

Japan Patent Office is not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original precisely. 
2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



CLAIMS 



[Claim(s)] 

[Claim 1] In the face image registration equipment which has a dictionary means by which a person's face image is 
registered A face field extract means to extract the input face image which is an image of said person's face field out of 
the image inputted by image input means to input said person's image, and said image input means, A decision means to 
judge whether the input face image extracted by said face field extract means is a face image which can be registered 
into said dictionary means, Face image registration equipment characterized by consisting of a registration means made 
to register into said dictionary means with said decision means by using as said face image the input face image judged 
that registration is possible. 

[Claim 2] Said decision means is face image registration equipment according to claim 1 characterized by extracting the 
characteristic quantity of a face from the input face image extracted by said face field extract means, comparing this 
extracted characteristic quantity with the criteria set up beforehand, and judging that registration is possible if said 
characteristic quantity is said beyond criteria. 

[Claim 3] It is face image-registration equipment according to claim 1 characterized by to judge whether it has a 
detection means of operation detect predetermined actuation of said person, and said decision means can register as said 
face image for the input face image extracted by said face field extract means between fixed periods after the detection 
while said detection means of operation has detected said predetermined actuation to register with said dictionary 
means. 

[Claim 4] The image input step which inputs said person's image in the face image registration approach for registering 
a person's face image, The face field extract step which extracts the input face image which is an image of said person's 
face field out of the image inputted in said image input step, The face image registration approach characterized by the 
input face image extracted in said face field extract step consisting of a decision step which judges whether it is the face 
image in which said registration is possible, and a registration step into which the input face image judged that 
registration is possible in said decision step is made to register as said face image. 

[Claim 5] Said decision step is the face image registration approach according to claim 4 characterized by extracting the 
characteristic quantity of a face from the input face image extracted in said face field extract step, comparing this 
extracted characteristic quantity with the criteria set up beforehand, and judging that registration is possible if said 
characteristic quantity is said beyond criteria. 

[Claim 6] It is the face image-registration approach according to claim 4 characterized by to judge whether it has the 
detection step of operation which detects predetermined actuation of said person, and the input face image extracted in 
said face field extract step between the fixed period after detection can register said decision step as said face image 
while having detected said predetermined actuation in said detection step of operation. 



[Translation done.] 
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DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to the face image registration equipment which extracts a face field out of 

the inputted image, and carries out dictionary registration of the face automatically, or its approach. 

[0002] 

[Description of the Prior Art] A face image recognition technique is a component engineering indispensable to 
construction of a human interface or a security system. Face image recognition consists of a face field extract, a face 
focus extract, and discernment. About the latest technical trend, reference [latest research trend [ of a Shiono **** 
Sanada / Hidehiko /" personal authentication technique ]", Shingaku Giho OSF92-17.] is detailed. The conventional face 
image identifying method can be classified into two of the roughly divided followings. 

[0003] One is the approach of calculating similarity with a candidate person's feature vector which parameterizes the 
location of the focus, such as an eye, a nose, and opening, a configuration, and size, generates a feature vector, and is 
registered beforehand. The person showing a dictionary vector with the highest similarity is discriminated from a person 
in question. These are classified into the structural-analysis-technique. 

[0004] Another approach is an approach based on the similarity of the pattern of the image by which a location and size 
were normalized by geometrical conversion of the two-dimensional affine conversion on the basis of the focus, such as 
a pupil and a nose, etc., and the dictionary normalization image registered beforehand. The person who expresses a 
dictionary image with the highest similarity like the former is discriminated from a person in question. These can be 
classified into the technique like a pattern according to the approach put in practical use in the conventional character 
recognition. 

[0005] In both [ of the two above-mentioned technique ] cases, statistical distance, such as a correlation value between 
images and Euclidean distance in the inside of a feature space, is used as similarity. Approaches, such as various pattern 
recognition theory put in practical use in character recognition, for example, a subspace method, [the Hidemitsu Ogawa 
written by ERUKKI OYA and Sato **** and "pattern recognition and subspace-method" Sangyo Tosho Publishing 
(1986)], and compound similarity, are applicable to this discernment processing. The configuration of the identifying 
method is detailed to [Funakubo ** "pattern recognition" KYORITSU SHUPPAN (1991)], [Taizo Iijima "pattern 
recognition theoretical" Morikita Shuppan (1989) etc.], etc. 

[0006] In addition to the above-mentioned face recognition processing, on the occasion of actual application of the 
above-mentioned face recognition, registration of a user, i.e., a registrant's face dictionary generation, becomes 
indispensable. For example, in the handwriting recognition of Chinese character using a subspace method, the 
dictionary was generated from the study sample image of hundreds of sheets for every alphabetic character. As for a 
three-dimension body like the target face, compared with an alphabetic character, large still a lot of [ fluctuation of 
configurations, such as lighting conditions, face sense, and expression change, or brightness ] study sample images are 
needed by this invention [Murase **, sherry NAIYA, "the three-dimension object recognition by two-dimensional 
collating", IEICE TRANSACTIONS (D-II) J77-D-II, 1 1, pp.2 179-2 187, and 1994.]. 

[0007] The conventional dictionary generation Reference [red Matsushige, Sasaki **, Akio Fukamachi, Yasuhito 
Suenaga, "the robust method of identifying a transverse-plane face by shade image matching", IEICE 
TRANSACTIONS (D-II), J76-DII, 7, pp. 1363- 1373, 1993.], [Yoshie Komatsu, Yasuo Ariki, "logging recognition of the 
face by the sense using a subspace method", PRU 95-191, pp.7- 14, 1996.], [-- M.Turk and A.P.Pentland : [ ] - "Face 
recognition using eigenfaces" ~ Proc.CVPR 11, pp.453-458, and 1993. --] - [Alex Pentland and Baback Moghaddam 
and Thad Starner, So that "View-based and modular eigenspaces for face recognition", CVPR'94, pp.84-91, 1994.], etc. 
may see The help was performing logging of a face and selection of an image from the image which turned and photoed 
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a registrant's face under good lighting conditions in the direction of predetermined, for example, a transverse plane, 15 
degrees of right and left, and 1 5 degree ** of vertical. 

[0008] A physiological change of the effect of the condition of the day, expression change and hair, and a mustache etc. 
tends to produce a face. Therefore, in order to realize a high recognition rate, it is necessary to update a face dictionary 
for every fixed period. However, the frequent renewal of a dictionary will raise a user's burden. So, in order to make 
face recognition easy to use, it becomes important how a burden is mitigated and registered without making a user 
conscious. 

[0009] For example, when it is interrupted temporarily and leaves the activity in an information terminal, the function to 
prevent others 1 access is considered, although using the dictionary generated beforehand is also considered - him - it is 
more effective to use the dictionary image generated as much as possible immediately before, in order to gather 
identification and the rate of others exclusion. However, in order to make a user's burden high, dictionary registration of 
carrying out dictionary registration at every **** is carried out without a user's making it conscious while accessing 
information machines and equipment. 

[0010] However, the study sample collected without making a registrant conscious contains the data which are not 
effective as study data the case where an eye is usually closed, when expression is changing a lot. If dictionary 
generation is performed from the study sample in which these error images are contained, the precision of a dictionary 
will fall. In order to raise dictionary precision, it is necessary to increase the number of study data, and real-time 
dictionary generation cannot be coped with in selection by the help. 
[0011] 

[Problem(s) to be Solved by the Invention] Conventionally, the help was performing generation of a face dictionary 
using predetermined, for example, a transverse plane, 15 degrees of right and left, and the face image that was made to 
turn a registrant's face in the direction for 15 degrees of every upper and lower sides, and was photoed in advance under 
good lighting conditions. A face changes with time amount. Therefore, in order to realize a high recognition rate, it is 
necessary to update a face dictionary for every fixed period. However, performing frequent renewal of a dictionary will 
raise a user's burden. 

[0012] About the automatic study sample collection for dictionary generation, if an approach [ finishing / application / 
already ] [Japanese Patent Application No. No. 61463 [ eight to ]] is applied, it is realizable. However, the technical 
problem that much data which are not effective in dictionary generation are contained remains in the study data 
collected without still making a registrant-ed conscious. In order to raise dictionary precision, it is necessary to increase 
the number of study data, and real-time dictionary generation is difficult to realize in selection by the help. 
[0013] Then, this invention provides a user with a burden, *♦**, the face image registration equipment that can 
eliminate registration of an unnecessary face image and can raise dictionary precision, and its approach. 
[0014] 

[Means for Solving the Problem] In the face image registration equipment which has a dictionary means by which this 
invention has registered a person's face image A face field extract means to extract the input face image which is an 
image of said person's face field out of the image inputted by image input means to input said person's image, and said 
image input means, A decision means to judge whether the input face image extracted by said face field extract means is 
a face image which can be registered into said dictionary means, It is face image registration equipment characterized by 
consisting of a registration means made to register into said dictionary means with said decision means by using as said 
face image the input face image judged that registration is possible. 

[0015] Moreover, this invention is set to the face image registration approach for registering a person's face image. The 
face field extract step which extracts the input face image which is an image of said person's face field out of the image 
inputted in the image input step which inputs said person's image, and said image input step, The decision step the input 
face image extracted in said face field extract step judges it to be whether it is the face image in which said registration 
is possible, It is the face image registration approach characterized by consisting of a registration step into which the 
input face image judged that registration is possible in said decision step is made to register as said face image. 
[0016] It judges whether it is effective as study data by comparing the criteria beforehand set up in the characteristic 
quantity which can be found from the face field extracted, for example as it is the above-mentioned invention. The 
image which normalized as characteristic quantity on the basis of the magnitude of the face focus, such as a pupil, a 
nostril, and **♦*, the amounts which can be found from positional information, or these points is used. It becomes 
possible to remove automatically the image from which expression changed a lot by this, the image which closed the 
eye, the image which opened opening from a study image. 

[0017] Furthermore predetermined actuation of mouse actuation, keyboard grabbing, button grabbing, etc. is detected, 
and dictionary generation is performed only for the image inputted during actuation. It decreases [ the time amount of 
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useless study data collection ] for being aimed only at under actuation and is efficient. Moreover, possibility of 
.collecting the images which are not effective as dictionaries in the case of having turned to width etc. also decreases. 
According to this invention, the equipment and the approach of extracting a face field out of the inputted image, and 
carrying out dictionary registration automatically are realizable. 

[0018] „ . 

[Embodiment of the Invention] Hereafter, one example of this invention is explained based on drawing 7 from drawing 

foOl 9] In information terminal units, such as a personal computer, in case under an activity is interrupted temporarily 
and a terminal is left, it is a required function to prevent others' access. Therefore, this example explains taking the case 
of the information-machines-and-equipment terminal unit 10 which carried the function to prevent others' access at the 
time of****. 

[0020] Drawi ng 1 is the block diagram showing the outline of the information terminal unit 10 with security concerning 
this example. 

[0021] This equipment 10 consists of the image input section 1 1, the face field extract section 12, the situation- 
recognition section 13, a private seal 1 14, the dictionary generation section 15, the access-control section 16, a 
dictionary image 17, and a predetermined actuation detecting element 18. 

[0022] (Image input section 1 1) The image input section 1 1 is for inputting the image of the person who becomes a 
candidate for recognition, for example, consists of a TV camera. The image 01 inputted from this image input section 1 1 
is digitized by the A/D converter, and is sent to the face field extract section 12. For example, a TV camera is installed 
in the lower part of the monitor of the information terminal unit 10. Or you may install in the rectangular head of a 

monitor. . 
[0023] (Face field extract section 12) The face field extract section 12 always continues extracting the lace field image 
02 from the input image sent from the image input section 1 1 . while moving the standard face image (template) 
registered beforehand over a full screen in this example - a correlation value - calculating - the highest correlation 
value - **** ~ let the field which is be a face field. Let the partial maximum point of a correlation value be a face field 
candidate. Suppose that a face does not exist, when lower than the threshold to which the correlation value in this face 
field candidate was set. It is set as 30, specifically using the maximum correlation value as 100 as a threshold. Since it 
corresponds to sense change of a face, if two or more templates are used by compound similarity etc., a face field can be 
further extracted to stability. This processing may be transposed to the extraction method based on the color information 
described previously. 

[0024] (Situation-recognition section 13) In the situation-recognition section 13, a user's condition (under an activity, 
*♦**, taking a seat) is discriminated from a time change of a face field extract result, and the change in dictionary 
generation mode and recognition mode is performed. 

[0025] It is set as recognition mode at the periods from TO to Tl the time of a face field beginning to be detected as 
shown in drawing 2 . the recognition processing described in this mode later -- him - identification processing is 
performed. It becomes accessible when identified him at the Tl time. 

[0026] When it changes to dictionary generation mode after Tl time and the dictionary generation signal from the 
predetermined actuation detecting element 18 is inputted, collection of study data is performed. It judges that it ****(ed) 
when it became impossible to have detected a fixed period face field, and a screen lock is started immediately. Or a 
screen lock may be started, after conveying the main point to a user with light, voice, etc. and seeing a user's reaction. 
Time of day Tl can be changed according to an object model or the class of access actuation here. 
[0027] (Recognition section 14) The recognition section 14 consists of face focus extract section 14a, normalization 
image generation section 14b, and pattern-matching section 14c. The block diagram of the recognition section 14 is 
shown in drawing! . . 
[0028] In face focus extract section 14a, the focus, such as a pupil, a nose, and ****, is extracted from the inside of the 
extracted face field. The approach [Japanese Patent Application No. No. 61463 [ eight to ]] which combined the 
configuration information for which it has already applied, and pattern information is applicable. 
[0029] The fundamental idea of this approach asks for the candidate of the focus using the configuration information 
that location precision is high, and verifies it by pattern matching. Since this approach positions using configuration 
information, it can expect a high location precision. Moreover, since matching which used the multi-template for 
selection of the right focus from a candidate group is applied, it is robust to fluctuation of the configuration brightness of 
the focus. Since pattern matching is carried out about processing speed only to the candidate who narrowed down with 
the degree-of-separation filter with little count cost, compared with the approach of carrying out pattern matching of the 
whole, drastic reduction of computational complexity is realizable. 
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0030] In addition to this Approach [A.L.Yuille based on edge information, "Feature extractionfrom faces using 
ieformable templates" and IJCV, vol.8 : 2 and pp.99-1 11,1 992.][Shizuo Sakamoto, the Miyao proton Joji Tajima, "a 
focus extract of the eye from a face image", IEICE TRANSACTIONS D-II, Vol.J76-D-II, No.8, PP-1796-1804 August, 
1993 1 and a proper space method Applied Eigen feature Method [Alex Pentland, Baback Moghaddam, Thad Stamer, 
ind "View-based and modular eigenspaces for face recognition", The approach [the Sasaki **** Akamatsu **, Yasuhito 
Suenaga, "the method of alignment of the face using the color information for face image recognition", IE 91-2, pp.9- 
1 5 and 1991.] based on CVPR'94, pp.84-91, 1994.], and color information is applicable. 

[0031] In normalization image generation section 14b, it normalizes on the basis of the focus. The example of the 
normalization processing on the basis of a pupil and a nostril is shown in drawing4 . The sense of vector El E2 is 
amended in parallel, further, from on cl and c2, the main coordinate of a normalization image and breadth are set up by 
2 twice vector E1E2, and a dip sets up the point CP of 1/3 by 2 twice vector clc2. 

[0032] In pattern-matching section 14c, it asks for pattern similarity as compared with a normalization image and the 
face image currently stored in the dictionary image. It identifies being him, when pattern similarity is higher than a 
reference value. In being smaller than criteria, it considers as others. As the approach of pattern matching, a correlation 
technique, a subspace method, a compound similarity method, etc. are applicable. 

[0033] (Predetermined actuation detecting element 18) The predetermined actuation detecting element 18 detects 
whether the user is performing predetermined actuation of a keyboard, a carbon button push, or actuation of a mouse. 
The dictionary generation section 1 5 is controlled based on a detection result. 

[0034] Since it is suitable to specifically carry out dictionary registration of the user performing predetermined actuation 
of keyboard grabbing, a carbon button push, actuation of a mouse, etc. since possibility that the transverse-plane image 
of a user's face is reflected to the image input means 1 1 is high when it detects, while performing predetermined 
actuation a dictionary generation signal is transmitted to the dictionary generation section 16. 
[0035] (Dictionary generation section 16) The dictionary generation section 16 consists of face focus extract section 
14a frame evaluation section 16a, normalization image generation section 14b, study image storage section 16b, and 
principal-component-analysis section 16c. With the recognition section 14, face focus extract section 14a and 
normalization image generation section 14b are common, and are used here. The block diagram of the dictionary 
generation section 16 is shown in drawing 5 . 

[0036] If a dictionary generation signal is inputted from the predetermined actuation detecting element 18, as tor 
dictionary generation, collection of study data will be performed. 

[0037] In frame evaluation section 16b, it judges whether the face field image 02 extracted from the information on the 
extracted face focus is effective in dictionary generation. Although three valuation bases are applied here, you may 
apply in individual or combination. 

[00381 (i) The physical relationship of each focus is specified beforehand, and when this physical relationship collapses, 
it is judged that it is not effective as study data. For example, they are both pupils, a nostril, the physical relationship of 
****, etc. 

T00391 (ii) The proper face image and pattern similarity which were stored in the dictionary image 17 are made 
evaluation When similarity is smaller than criteria, it is judged that it is not effective. The homogeneity of study data 
will be raised, so that criteria are made high. When too high, it becomes impossible however, to correspond to change ol 

[0040^7*6 dictionary image generated from the study image normalized by the right focus in advance instead of the 
proper face image stored in the dictionary image 1 7 may be used. 

[0041] If it furthermore takes into consideration to the face sense, it is possible to remove the face which turned to width 
extremely In this case, when the comparison with the face dictionary pattern currently beforehand prepared for every 
face sense of a direction 1 - N is performed and it is in agreement with the dictionary of the predetermined face sense, it 
is judged that it is effective in dictionary generation. 

[0042] (iii) The study data which investigated the relation of the focus extracted for every frame, and closed the eye are 
detected The coordinate of the focus extracted newly can be expressed by the linear combination of the coordinate of 
the focus already extracted with the frames 1-4 before that. Therefore, if the focus is correctly extracted with all frames, 
the error at the time of carrying out linear combination will become small. Conversely, an error becomes large, when an 
eye is closed and eyebrows and a pupil are mistaken. 
[0043] Drawin g 6 is used and explained to a detail about this. 

00441 If an orthographic projection model is assumed, the two-dimensional coordinate of the face focus seen from the 
direction of arbitration, without having a three-dimension model By the linear combination of ^<>-&™™™* 
coordinate of the focus matched on the image of four sheets as shown in a formula (1) and (2) [S.UUman which can be 
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expressed, and R. Basri: "Recognition by Linear Combinations of Models, IEEE Trans.PAMI, Vol.13, No.10, pp.992- 
1006, 1991.][Yasuhiro Mukogawa, Yuichi Nakamura, Tomoichi Ota, "generation of the face image of the direction of 
arbitration using two photographs of his face", ******, Vol.37, and No. 4, pp.635-644, and 1996. --] . The two- 
dimensional coordinate value (X, Y) of the focus newly detected from two-dimensional coordinate value [ of the right 
focus in four frames already extracted in the dynamic-image train using this property ] (xl and yl) - (x4 and y4) is 
verified. 

[0045] It asks for linear combination multiplier ai, bi, (i=l, and 4) with a least square method from the two-dimensional 
coordinate of the focus to the image of four frames, and a new coordinate (X, Y). The error Res which calculates (x\ y') 
for an approximation coordinate from the coupling coefficient obtained conversely (X, Y) is calculated shortly. 
[0046] 
[Equation 1] 

X-apXj +*j.x 2 + a 3' x 3 + a r x 4 (1) 
Y-bj.yj +b r y 2 + *> 3 ' y 3 +t> r y 4 (2) 
Rt t=J (x-x' ) * + (Y-y' ) 2 (3) 

Error Res shows the coordinate error of the new focus. Since the error has arisen in the extract location when an error is 
larger than a threshold, this data is not effective as study data. 

[0047] Above (i) - (iii) It is also possible to judge by whether distribution of the average luminance of a face field or 
brightness is contained in the criteria range besides the valuation basis. 

[0048] study image storage section 16b - the above - the study data judged to be effective are stored. If the stored 
number of study data reaches the number of regular, a study image will be sent to principal-component-analysis section 
16c. 

[0049] In principal-component-analysis section 16c, principal-component-analysis (KL expansion) application is carried 
out to the image data stored in study image storage section 16b, and it asks for a proper image. The proper image of a 
high order N individual is stored in the dictionary image 17 from the one where characteristic value is larger. 
[0050] Although the above-mentioned example explained the example which performs dictionary generation on real 
time from a dynamic image, it is also possible to carry out, once it stores in memory only the frame by which the 
detecting signal from a predetermined actuation detecting element came from the dynamic image. In this case, since 
possibility effective in dictionary generation is aimed only at a high frame, improvement in the speed and stability of 
processing can be measured. 

[0051] Moreover, although the subspace method which is the technique like a pattern is applied to discernment in the 
above-mentioned example, you may transpose to structural technique. 

[0052] (Explanation of actuation of this equipment 10) Next, actuation of this example is explained along with drawing 
7. 

[0053] The first access performs the usual log in which enters a password etc. Or it is also possible to perform an 
automatic log in using the dictionary of the user who registered in advance. 

[0054] In the face field extract section 12, a face field is extracted from the image first inputted from the image input 
section 1 1, and it is sent to the situation-recognition section 13. Moreover, in the predetermined actuation detecting 
element 18, predetermined actuation detecting signals, such as mouse actuation, keyboard grabbing, and button 
grabbing, are sent to the situation-recognition section. In the situation-recognition section 13, the change in recognition 
mode and dictionary generation mode is performed based on the time existence of a face field. Moreover, it controls 
whether study data are collected based on a predetermined actuation detecting signal. 

[0055] time of day TO to Tl sets it as recognition mode first - having - the last user - it identifies whether you are him. 

[0056] (i) A recognition mode face field is sent to face focus extract section 14a, and a pupil, a nostril, and **** are 
detected. Next, normalization is performed on the basis of these focus. Similarity with a user's dictionary image is 
calculated last time by pattern-matching section 14c, and when similarity is higher than a reference value, a 
normalization image is identified him who was using it last time, and becomes accessible. Access is forbidden when 
smaller than a reference value. 

[0057] Next, from time of day Tl , it is set as dictionary generation mode. 

[0058] (ii) When the dictionary generation mode predetermined actuation detecting element 18 detects predetermined 
actuation and the predetermined actuation detecting signal is turned on, study data are collected and a dictionary image 
is generated. The face field image extracted from the image inputted continuously is sent to face focus extract section 
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14a, and a pupil, a nostril, and **** are detected. The extracted focus information is sent to frame evaluation section 
16a[ for example, an error is calculated from a formula (3). When an error is smaller than a threshold, it is judged that it 
is effective as study data, and it is stored in study image storage section 1 6b. If the study data more than convention 
number of sheets are stored in study image storage section 16b, study data will be sent to principal-component-analysis 
section 1 6c, and a dictionary image will be generated. 

[0059] If a user ****, a screen lock will be started immediately. If a user takes a seat again, it will return to the first 

processing. . 
[0060] In addition, in the above-mentioned explanation, when the predetermined actuation detecting element 18 
detected predetermined actuation and the predetermined actuation detecting signal was turned on, the dictionary image 
was generated, but further, after a predetermined actuation detecting signal suits at OFF, a dictionary image may be 
continuously generated between fixed periods (for example, for 1 minute). 

[0061] (Example of modification) Next, the example in service of not only the security of an information terminal but 
an information terminal is explained. 

[0062] For example, when registering a face, it considers memorizing to coincidence not only the characteristic quantity 
obtained from a face field but the actuation hysteresis of an information terminal, while operating discernment of the 
human being, simultaneously a terminal the information on a face, simultaneously by registering, it can extract from 
hysteresis what kind of information was searched for. 

[0063] If information retrieval is made into an example, a user will do the activity which inputs a retrieval type etc. In 
addition to information required for identification, the hysteresis of the retrieval type and the contents of retrieval is 
stored in the database. Based on the stored data, the user's interesting contents are presumed and extracted and it holds 
by performing new information retrieval. 

[0064] And again, when a user performs a log in to an information terminal, and an actuation return, the service ot him 
which offers automatically information considered to be the request newly searched to the terminal is attained. 
[0065] By linking such hysteresis information with the registration information on a face, and being stored, it may 
become instead of the time stump when to have retrieved such information, it may recognize whether it is close to the 
condition of the face at the time of when based on this, and the information based on the hysteresis information on of 
those days may be shown. 

[0066] Moreover, as a verification device in the case of registration of a face, if expression detection and recognition are 
used, it is also possible to take the actuation hysteresis of the information according to human being's feeling condition, 
and the dictionary at the time of usual, the dictionary when laughing, the dictionary when being angry, etc. can also be 
generated. In case face recognition is newly performed, information offer corresponding to expression information may 

be performed. . . 

[0067] When these perform continuously renewal of face registration, and renewal of information actuation hysteresis to 

coincidence, clearer information offer is attained. 

[0068] Although these examples explained the information terminal to the example, they may be transposed, for 
example to home electronics, such as TV and a telephone. 

[0069] In the case of TV, a subminiature camera is carried in remote control. Predetermined actuation is defined as the 
actuation which is pushing the carbon button of remote control. The actuation which in the case of a telephone micro 
CCD carries in an earphone and raises an earphone is defined as predetermined actuation. 

[0070] Furthermore, the equipment 10 of this example may be used for an automatic teller's machine (henceforth ATM). 

[0071] namely, him, a user,, by using this equipment 10 for a check, it forgets be based on others and the unauthorized 
use of usurpation of cash etc. can be prevented certainly. In this case, the image input section 1 1 is arranged near the 
control panel of ATM, and it is made for a user's face to be reflected certainly. Moreover, the account number and 
updating registration time of day may be registered into a face image and coincidence. 
[0072] 

[Effect of the Invention] As mentioned above, a new face image is extracted out of the image which was inputted 
according to this invention, in order to register only the input face image which judged whether it was the face image 
with which this deserves dictionary registration, and was judged to be effective, registration of an unnecessary face 
image can be eliminated and dictionary precision can be raised. 
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imiUBl ¥fi£9^2£25H 

[#1*91*0118 J 
■fSHBlMMHStttea^r . 

mm&\Ji*mz£ ^xx^tuzmm^t^mLK 

iamb. 

1WBH*U*ttffl¥«fc J: ->r ttiai$*i*: A*£B&#. 

mmmmm^mzx ■oxm&ztifzxjiwmm^ 

LX. 

m§mmmnmiiitx'htnmmmb mti 
z b zmib^m $m i sftttgiBftsgsg. 

mmmrnt. 

mmmm-mfiimbfe<r>w$*m&Lx^hwi. 
d t zmib-?2>m?m i iE®o««g^ia. 

MIBB&A^r >vTlz&^XXjlZtitzWm*frt>1fi 

iiMacomffi£<r>m&x'b h xwmfSk 5-«iai-r h mm 
mm*rvTb. 

7b. 

mmm*Tv7£te^x%mmbwmtit:xiim 
mmzmmsiSLb ix^^ith^m^yr^^ 
zzbtmbttmrnimwim. 



Lr. 

imm&tmizmMiX'hix&m^mbmrtz 
zbimtbi-imm4im,<om^mum. 

mmsimiti^yy-izii^xmim^mm^iiii 
rv^s. ttM. zotiitm^&mnwuz. mem 
mmiftZTvrizto^xmizziitixijmwmmi 
mmmb txm^m^t^imhz b sr^stt-r 

[0001 ] 

mmwt tzuwKmizmz . 

[0002] 

[ft*<9&ffi] SB0f2P^(i:. h A -vW^7 

=or»«SE»Tft*. mmmimt. mwmtn. mm 

ft [JS»3t, KfflUg "fflAI2iEa»^S3fi^5?iS 
ft" . fl^Sffi 0SF92-17. ] jWBU*. flg#*>IIB«18 
**<4Ht4iaT»2ofc:4H«r'**. 

[0003] i, a. p*if^)W8Ui«^)fia. 

Yti*mmz%xbm\tz>. ztiummm 
ttmizfrmztiz. 

[ 0 0 04 ] t ? loco^i. IS, S^ri:<om^5r 
^X*jE«fl:$<lfcB«i: : W>e«$*irv^Sf 

SE&ffcBflii: -yoamsnsv^^r-j) 

s. mbmtmttmmjm^&mmgiz&t/M 

tix^zttx-w-ymttiizmx'**. 
[ooo5] _lib<7) 2 oo^a^) fc'*> o^-^ t . g& 
LriiB«^fflM€ J ^m^+r^^--^ U >y 

*4=B«r*ltfflfl:$iir v^«« ^^-yBNStt. 

6) ] ^mizmwstt: b'imimmvz & . assoacoaus 

{± "-»V->W ftiffillS (1991) ] [IS 

mm u w-ymmm m mtmm (i989> ] 
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LtfWVt, S'xtf "2<fc7c!S£t;:J:43 
<fc7EIMWHr . ft^ti (D-II) J77-0-II, 11. PP. 21 
79-2187, 1994.] . 
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it. "&tt£Wkzm^t:foziz£c > %^m(D®>)i}ii, 

88" , PRU95-191, pp. 7-14, 1996.] . [M. Turk, 
A. P. Pentland: " Face recognition using eigenface 
s " . Proc. CVPR 11. pp. 453-458, 1993. ] . [Alex 
Pentland, Baback Moghaddam, Thad Starner, " View-ba 
sed and modular eigenspaces for face recognition " 
, CVPR '94, pp.84-91, 1994. ] %b'lzM.t>tl& <* 0 
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[ 0 0 1 0 3 b Z ZtfgS&SlzMmZtt^X-Wlk Itz 

v-fiisAx^h. zti^WkOmm^iiix^ 
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*] tm~m&mx'$&. t*u imtixwa 
mizwmzit%^x'WikiK^T-?izimm£s$L 
tzmx%^T-ftf&®&tti& b^mwmh. 
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m*). Amziz&ttxitvTiu?4J>®m&m s n 
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IZblzi O^r-^i: LXGWfrvWWiZftd . » 
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mi 2, #fflfgi§sgsi3. bi i4. sm&fiBi 
5.79 tzrnmm 1 6 . mm • 1 7 . m^»^«iai 
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[0029] £<o^*)**«fr#;ltt. 

- h wtr? i+v?*m Lx^htdhizws^ 
mm&nymiztt lxuj <x bx-hi. mmeuzm 

Mzm&nLxnwf-y&^hwx-kmw 
-vmGi-zimiz&^n&n^KtwMtfmix't 

[0030] ^WffitCt. JLyi;fflMZ&*3<1jm [A. 
L.Yuille, " Feature extractionfrom faces using de 
formable templates " , IJCV. vol. 8: 2. pp. 99-11 

i, 1992.] mm=f-. mmc, "ms& 

a>^go#®£**ar . mm d-h, v 0 i. j76-o-i 

I. No. 8. pp. 1796-1804, August, 1993. ] *S»ffi*ffi 
O^SriiffltitEigen feature ft [Alex Pentland.Baba 
ck Moghaddamjhad Starner, " View-based and modula 
r eigenspaces for face recognition ", CVPR '94, p 
P. 84-91. 1994.] . *7-m.\iZWr}<im [fe^* 

, 1E91-2. pp. 9-15, 1991. 

] tfmm*imx'h&. 

[0031] lEMitnmjm 1 4 bx-i4. 
mzixiEm-itzmt. mizm. stf&mwzi,t:JE 

mittSm<7)MZ*-f. h^E 1 E 20[fiJ^ ^TOt: 
«IELT$^lCcl, c20±*^l/3^CP?:iE 
«4iaMW»*"MaiR. «MB4^ h/UE 1 E 202fg. 
mmi*? h/Uc 1 c 2tf)2fglCi££-f 6. 
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w&t&mwmzmt t>tix^hwm&t)m.Lxw 
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[00 33] ( ^Sftf^HigB 1 8 ) ©f^s&f^mgp 1 

a. 

[00343 JlttWfctt. H«*. # 

#^ft^^^»4^a5 1 6 (caww s . 
[oo35] mm£jmi6)mmL$Mi6\*. m 
mjmta^ ua.7 ^-^img? i6a. smm 

14b. ^^M^IEItSS 1 6 b Rtf ±j8*MMfr 
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fcBte£j£« 1 4 b (4. 1218315 1 4 fc&jTCffiffl-r * . 
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[00443 JESrtJt f^^fisr*-* . 3 856*7^ 
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h [S. Ullman, R. Basri: "Recognition by Linear Co 
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- abb*:-. ^tkomnzm^miimm 

. -fff5ftf&. Vol. 37, No. 4. pp. 635-644, 1 

996.] . zmmmmvxmmmiz&^xmizMiii 

ZhX^Z4&<D7]/-Alzm&lEl\*®®)!i<7)2d: 
7C^Sfi(x! , y, )~(x 4 , y, ) *>£>frL<}fc 
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tWLT (X, Y) fc^MHResfcttStf-*. 
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*»nt*08m. &->x^z>t$mm%t'££m-& 
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[0067] zixwmmmMtm^kmmviw. 
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